Method of LP-based blind restoration for improving intelligibility of bone-conducted speech
نویسندگان
چکیده
Bone-conducted (BC) speech in an extremely noisy environment is stable against surrounding noise so that it may be able to be used instead of air-conducted (AC) speech for communication. However, it has very poor sound quality and its intelligibility is degraded when transmitted through bone conduction. Therefore, voice-quality and the intelligibility of BC speech need to be blindly improved in actual speech communication and this is a challenging new topic in the speech signalprocessing field. We proposed an LP-based model to restore BC speech to improve its voice-quality in a previous study. While other methods such as Long-term Fourier transform need to use numerous AC speech parameters to restore BC speech, the proposed model can blindly restore BC speech by predicting BCLP coefficients from AC-LP coefficients. We improved the proposed model by (1) extending long-term processing to framebasis processing, (2) using LSF coefficients on LP representation, and (3) using a recurrent neural network for predicting parameters. We evaluated the improved model in comparison with other models to find out whether the model could adequately improve voice quality and the intelligibility of BC speech, using objective measures (LSD, MCD, and LCD) and carrying out Modified Rhyme Tests (MRTs). An evaluation of these three improvements to the LP-based model proved the practicability of blind-BC restoration.
منابع مشابه
Reconstruction filter design for bone-conducted speech
Bone-conducted speech is of low intelligibility, but its quality is not affected by noise. In this paper, we take into account such propertes of bone-conducted speech, and address a digital filter to reconstruct the quality of the bone-conducted speech signal obtained from a speaker. The reconstruction filter design method is derived based on a model assumption of pronunciation. Experimental re...
متن کاملHarmonics Enhancement for Determined Blind Sources Separation using Source’s Excitation Characteristics
We present an improved method on combining temporal and spectral processing approaches for multichannel determined blind sources separation. The separation task is performed by applying the spectral processing on a mixed speech, using sources’ excitation characteristics. The performance of the proposed method is investigated by separating two sources from a stereo recording mixture extracted fr...
متن کاملThe effect of redesign workstation on Speech Interference Level (SIL) among bank tellers
Abstract Background: There is always an interaction between man and his environment that can be the cause of physical, physiological and psychological stress on people and also cause discomfort, annoyance, and have direct and indirect effects on their performance and productivity, health and safety. People in their workplace are exposed to many factors related to work activities and environmen...
متن کاملSpeech intelligibility after repair of cleft lip and palate
Background: Intelligibility refers to understandability of speech; and lack of it can negatively affect children’s overall communication effectiveness. Children with repaired cleft lip and/or cleft palate (CL/P) may experience poor speech intelligibility. This study aimed at evaluating speech intelligibility in children with repaired CL/P who had not been referred to sp...
متن کاملImplementation Aspects of the Adaptive Gain Equalizer
The quality of speech, or important speech parameters such as the intelligibility, clearness or naturalness of speech, can be emphasized by signal processing. Such processing for improving speech quality can be found in telecommunication applications, e.g. mobile telephony, internet telephony or personal intercom. Blind methods are preferable over conventional because they do not require calibr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007